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It is a recent realization that many of the concepts and tools of causal discovery in machine 
learning are highly relevant to problems in quantum information, in particular quantum nonlocality. 
The crucial ingredient in the connection between both fields is the tool of Bayesian networks, a 
graphical model used to reason about probabilistic causation. Indeed, Bell's theorem concerns a 
particular kind of a Bayesian network and Bell inequalities are a special case of linear constraints 
following from such models. It is thus natural to look for generalized Bell scenarios involving more 
complex Bayesian networks. The problem, however, relies on the fact that such generalized scenarios 
are characterized by polynomial Bell inequalities and no current method is available to derive them 
beyond very simple cases. In this work, we make a significant step in that direction, providing a 
general and practical method for the derivation of polynomial Bell inequalities in a wide class of 
scenarios, applying it to a few cases of interest. We also show how our construction naturally gives 
rise to a notion of nonsignalling in generalized networks. 


Bell's theorem ji] demonstrates that our classical con¬ 
ceptions of causal relations must be taken with care, 
as they fail to commit with the results obtained in 
some quantum experiments performed by distant par¬ 
ties, the phenomenon known as quantum nonlocality. 
Even without detailed information about the underly¬ 
ing processes, the causal structure of the setup alone 
already implies strong constraints - the famous Bell's 
inequalities - on the correlations that are compatible 
with it. 

This is close to the reasoning employed in the field of 
causal inference Ii 2 l| 3 |/ a connection that has recently at¬ 
tracted considerable attention [gJHlj. Since Bell's theo¬ 
rem is a statement about classical correlations, it comes 
as no surprise that mathematical tools and concepts, 
originally devised in a causal inference context, can also 
be applied to the study of nonlocality. Indeed, Bell's 
theorem concerns the same kind of causal structures 
that are the object of study in Bayesian networks lH and 
Bell inequalities are a special case of linear constraints 
following from such models [gj. Bayesian networks not 
only offer a new conceptual perspective to revisit quan¬ 
tum nonlocality l|8j ITTl but also provide the right lan¬ 
guage to devise generalized Bell scenarios Bil. 

Several extensions of the paradigmatic Bell experi¬ 
ment - two distant parties, performing two possible ex¬ 
periments on their shares of a joint system - have been 
proposed, including more parties [T^ , more mea¬ 
surements/outcomes [ [151 li^ and sequential measure¬ 
ments [TtIIi^ . However, all these different generaliza¬ 
tions share the same basic property: the correlations be¬ 
tween all the parties originate from a single (not directly 
observable) source, being therefore named as local hid¬ 
den variable (LHV) models. In spite of the rich plethora 
of phenomena and applications [ [T^ , LHV models rep¬ 
resent a very particular case of the possibilities offered 
by Bayesian networks. Those t5rpically include several 


independent hidden variables and will be named here 
as generalized local hidden variable (GLHV) models. 
These scenarios with many independent sources are 
also ubiquitous in quantum information, e.g., entangle¬ 
ment percolation ll2oi , entanglement swapping ll2Tl and 
quantum repeaters ll22l[23| . Thus, understanding gener¬ 
alized Bell scenarios is not only of fundamental interest 
but also of high practical relevance. 

Within that context, the basic question to be solved 
is how to derive Bell inequalities for general Bayesian 
networks. Bell inequalities play a fundamental role in 
study of nonlocality, since it is via their violation (e.g. 
with quantum entangled states) that we can witness the 
nonlocal character of a given experimental data. Unfor¬ 
tunately, as opposed to usual Bell scenarios, a GLHV 
model implies a non-convex region - characterized by 
pol5momial Bell inequalities - of correlations that are 
compatible with it. Generally, algebraic geometry meth¬ 
ods can be used to characterize such pol5momial con¬ 
straints [ j2^[25| , but given their computational complex¬ 
ity, in practice they are intractable even for very simple 
models ||^. Arguably, because of this difficulty, only 
sparse results have been obtained in the derivation of 
Bell inequalities for GLHV models, either using coarse¬ 
grained information jg] or considering par¬ 
ticular scenarios However, to our knowledge, 

no practical and systematic method for the derivation 
of polynomial Bell inequalities for GLHV models is 
known to this date. 

In this paper we propose a general method for de¬ 
riving polynomial Bell inequalities in a wide class of 
Bayesian networks. In spite of the non-convex character 
of the problem, we show how to obtain polynomial in¬ 
equalities resorting to a linear programming technique, 
namely a Fourier-Motzkin elimination [3^ . We illus¬ 
trate the general method applying it to a few relevant 
cases and derive new pol5momial Bell inequalities. Fur- 
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FIG. 1 . DAG representation of Bayesian networks, (a) Bipar¬ 
tite LHV model, (b) GLHV model with 2 independent hid¬ 
den variables representing the bilocality scenario of (c) 
GLHV model with 2 hidden variables shared among 4 parties. 


thermore, we explain how our construction naturally 
leads to a notion of nonsignalling correlations Ij^J in 
generalized Bell networks. 


I. BELL INEQUALITIES, BAYESIAN NETWORKS AND 
MARGINAL PROBLEMS 

Bell scenarios beyond LHV models can be repre¬ 
sented via the graphical notation of Bayesian networks 
IE 1 | 3 J- Underlying models are represented by directed 
acyclic graphs (DAG), where nodes stand for variables 
and directed arrows represent their causal relations llzl . 
While LHV models correspond to a DAG with a sin¬ 
gle hidden variable (see Fig. Qa)), GLHV models are 
represented by DAGs with n > 2 independent hid¬ 
den variables (see Fig. [ipb)-(c)). The causal relations 
implied by a DAG are captured by the (conditional) 
independencies (GI) implied by the graph and that 
can be listed by the d-separation criterion JJl- For in¬ 
stance, for the LHV model in Fig. j^a) it follows that 
p{x,y,\) = p{x)p{y)p{A) and p{a\x,y,X) = p(fl|x,A) 
(similarly to b). Thus, any observable data - given 
by the probability distribution p{a,b\x,y) - compatible 
with this LHV model can be decomposed as 

p{a,b\x,y) = J 2 p{a\x,A)p{b\y,\)p{A). (i) 

A 

That is, any local distribution must lie inside the convex 
set defined by 0 ' the so-called correlation polytope C 
[ j^l^ . In this geometric picture, (linear) Bell inequal¬ 
ities are nothing else than facets of C. Given that it is 
easy to list the extremal points of C, to find its facets 
amounts to an efficient linear program, arguably the 


reason why this method has become the most promi¬ 
nent in the study of nonlocality. 

Another equivalent, but far less used method, comes 
from the realization that Bell inequalities are constraints 
arising from a marginal problem [ j37lj^ , that can be 
stated as: given some marginal distributions of n vari¬ 
ables is it possible to find a joint distribution of all 
variables, such that this distribution marginalizes to the 
given ones? To see that Bell's theorem is indeed a par¬ 
ticular marginal problem, notice that the LHV descrip¬ 
tion 0 is equivalent to the existence of a joint distri¬ 
bution p = p{aQ,... ,am^,bQ,... ,bjny) (represented as a 
vector p) describing the probability for outcomes of all 
possible measurements, where labels the outcome a 
given that x = i = { 0 ,..., Mx} and similarly for b. Since 
p defines a valid probability, it is constrained by a set 
of linear inequalities Lp > 0 given by p, > 0 (positiv¬ 
ity) and Pi = 1 (normalization) defining the simplex 
polytope P H^ . Given that at each round of the exper¬ 
iment only one a, and one bj can be measured simul¬ 
taneously, p defines a non-observable quantity. How¬ 
ever, the constraints on p will also imply constraints on 
the level of the observable distributions p(fl,, by). These 
are exactly Bell inequalities, that in this picture can be 
understood as a condition for the marginal problem to 
have a positive answer. Thus, to obtain Bell inequalities 
in this picture, we have to eliminate from our descrip¬ 
tion all non-observable terms. This is a achieved via a 
FM elimination [ [3^ , a standard algorithm for the elim¬ 
ination of variables from a system of inequalities. 

For simplicity and without loss of generality, in the 
remaining of the paper we focus on dichotomic out¬ 
comes (e.g. fl, = 0 , 1 ). It is then convenient to consider 
the equivalent description of the problem in terms of 
the correlation vector E with components given by ex¬ 
pectation values, e.g., {AiBj) = 

The vectors E and p are linearly related as E = T“^p im¬ 
plying that E must obey linear inequalities TE > 0 plus 
a normalization constraint. To illustrate the FM elimi¬ 
nation consider the GHSH scenario where each of 
the two parties in Fig. [^a) can measure two observ¬ 
ables. The inequalities 

(AoBo) + {AiBo) - (AoAi) < 1, (2) 

(AoBi) - (AiBi) + (AoAi) < 1, (3) 

directly follow from TE > 0 after the elimination of 
terms like (AqAiBoSi) and (AqAiBo)- The sum of 0 
and 0 eliminates the remaining non-observable term 
(AgAi) leading exactly to the GHSH inequality [^T] . 


IT POLYNOMIAL BELL INEQUALITIES 


Similarly to LHV models, a GLHV model also im¬ 
plies the existence of a well defined a joint distribu- 
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tion p characterized by linear inequalities Lp > 0 . The 
difference resides on fhe facf fhaf GLHV models also 
imply a sef of non-linear inequalifies Wp > 0 (where 
W = W(p)). Thus, a GLHV model is characferized by 
infersecfion of P wifh Wp > 0 , fhaf is, a semi-algebraic 
sef m- As discussed before, fhis sysfem of inequalifies 
involves non-observable quanfifies fhaf have fo be elim- 
inafed in order fo obfain a descripfion in ferms of em¬ 
pirically accessible variables only Formally, fhe prob¬ 
lem af hand is equivalenf fo a quanfifier eliminafion: 
fhe projecfion of a semi-algebraic sef onto a subspace of 
if, fhaf by Tarski-Seidenberg fheorem is again guaran¬ 
teed fo be a semi-algebraic sef [ p^ . In ofher ferms, fhe 
correlafions compatible wifh a GLHV model are char¬ 
acferized by finifely many pol5momial Bell inequalifies. 
Quanfifier eliminafion is routinely encounfered in alge¬ 
braic geomefry problems, fhus general purpose mefh- 
ods have been developed [ p^ . Unforfunafely, given 
fheir compufafional complexify, fheir applicafion fo Bell 
scenarios is infracfable even for fhe simplesf possible 
models 0- Nofwifhsfanding, we show nexf fhaf a sim¬ 
ple adapfafion of fhe FM eliminafion leads fo practical 
and compufafional fracfable way for deriving pol5mo- 
mial Bell inequalifies. 

The class of DAGs we consider are fhose which dis¬ 
play (conditional) independencies on fhe level of fhe 
joinf disfribufion p. This is fhe case, for insfance, in fhe 
DAG of Fig. Qb) implying fhe independence relafion 
p{a,c) = p{a)p{c) and for many ofher relevanf scenar¬ 
ios in quanfum informafion |j^ |2i] [^ . 

The mefhod fo derive pol5momial inequalifies for fhis 
class of scenarios proceed as follows. 

Given p we firsf need fo lisf all ifs componenfs fhaf 
are fo be eliminated from our descripfion: po and p^o 
sfand, respecfively, fo fhe sef of observable and non¬ 
observable (fo be eliminated) componenfs p,. We also 
lisf all fhe ferms in p^o appearing in a non-linear fash¬ 
ion in Wp > 0 , labeled by pwNO- Notice fhaf all ferms 
appearing in p^o buf nof in pwNO can be eliminated 
via a usual FM eliminafion over Lp > 0 , obfaining a 
new sef of linear relations L'p > 0 . The ferms in pwNO 
have fo be eliminafed considering L'p > 0 and Wp > 0 
joinfly. To fhaf aim, notice fhaf Wp > 0 can be lin¬ 
earized by considering some of fhe variables as free pa- 
ramefers of fhe problem. Given Wp > 0 fhere is going 
fo be a minimum sef of variables P^no need fo be 
sef fo free paramefers in order fo linearize fhe problem. 
This means fhaf we can apply a FM eliminafion fo fhe 
remaining ferms obfaining a final sef of inequalifies fhaf 
will depend linearly on fhe observable ferms po and 
pol5momially on ferms Pwno- observable dafa will 
also imply linear consfrainfs on fhe paramefers P^no' 
Togefher wifh fhese consfrainfs, fhe obfained pol5momi- 
als can be furfher simplified by usual quanfifier elimi¬ 
nafion mefhods, finally arriving af pol5momial inequal¬ 


ifies involving observable dafa only. We highlighf fhaf 
following fhis procedure, one can derive all pol5mo- 
mial Bell inequalifies following from fhe infersecfion of 
Lp > 0 and Wp > 0 , fhaf is, our mefhod provides a full 
characferizafion of fhe GLHV models under consider¬ 
ation. In practice, however, a partial characferizafion 
(e.g. in ferms of full correlators only) will often be fhe 
only compufafionally fracfable approach. 

To illusfrafe fhe general mefhod, we sfarf considering 
fhe bilocalify scenario, one of few cases for which poly¬ 
nomial Bell inequalifies are known I pqllJT] . The sce¬ 
nario involves fhree parties wifh correlafions mediated 
via fwo independenf sources (see Fig.Qb)). In fhe par- 
ficular case of fwo dichotomic measuremenfs per parfy 
fhe following inequalify has been proven fo hold 

ml 

^\l\ + ^\l\< 2 , (4) 

where I = Ex,z=o,i {AxBqCz) and / = 

{^xBiCz)■ However, fhe mefhods 
in [31] cannof be easily generalized fo differenf sce¬ 
narios, for insfance, considering fhree measuremenfs 
per parfy. Nexf we show in defails how our framework 
can be employed fo easily prove 0- We fhen proceed 
fo derive new pol5momial Bell inequalifies. 

To derive Q, we consider fhe independence con- 
sfrainf following from fhe DAG in Fig.jipb): 

(AoAiCoCi) = (AoAi) (CoCi). (5) 

We need fo combine l|^ via a FM eliminafion wifh fhe 
linear inequalifies TE > 0 . If is sufficienf fo consider 
fwo inequalifies following from TE > 0 : 

±l - (AoAi) - (CoCi) - (AoAiCoCi) < 1, (6) 
±/ + (AoAi) + (CoCi) - (AoAiCoCi) < 1. (7) 

Subsfifufing Q in 0 and 0 and affer some algebraic 
manipulations, we can combine bofh inequalifies info a 
single pol5momial inequalify 

2 (AoAi)2 + (±7 ^ 1 ) (AoAi) - (±f ± 7 + 2) < 0. (8) 

As discussed before, we arrive af an inequalify fhaf de¬ 
pends linearly on fhe observable dafa (ferms 1 and ]) 
buf have a non-linear dependence on non-observable 
ferms, in fhis case {AqAi). The minimum of fhe poly¬ 
nomial in is achieved af {AqAi) = (±f 4= 7 )/ 4 , im¬ 
plying fhe inequalify in ferms of observable dafa only 

-(l/8)(±f-T7)"-(±^±7 + 2) <0. (9) 

This is a quadratic equafion fhaf can be easily solved, 
e.g. for 1 , leading exacfly fo Q. 

Anofher nice teafure of our consfrucfion is fhe facf 
fhaf independencies are nof required fo hold exacfly. 


4 


For instance, we may be interested in quantifying how 
much a given constraint must be relaxed in order to 
explain some experimental data lITTl [4^ . In the bilo¬ 
cality scenario if we allow for correlafions C/^c > 

I (AgAiCoCi) — {AqAi) (CqCi) I befween parfs A and C, 
if follows fhaf 

-{l/8)i±I-T})^-i±I±} + 2 ) < 2 Cac, (10) 

fhaf is, fhe violafion of l|^ quanfifies fhe degree of cor- 
relafion required fo classically reproduce some non¬ 
bilocal correlafion. As an illusfration, consider the cor¬ 
relation I = J = 2 that can be achieved quantum me¬ 
chanically with two copies of Bell sfafes shared befween 
fhe parties [31] . In order fo be classically reproduced, 
fhis correlation requires Cac = 1/ that is, maximal cor¬ 
relation between parts A and C. 

To further illustrate the practicality and relevance of 
our mefhod we also derived new pol5momial inequali¬ 
ties. See fhe Appendix for a defailed discussion. For 
fhe considerably more complicafed GLHV model in 
Fig-Qc) fhe inequalify ||^ is also valid if we define new 
functions given by f = — (AiBgCoDo) — (AiBgCoDi) -|- 
{AiBiCqDo) + {AiBiCqDi) and / = {AqBqCiDq) — 
(AoBoCitTi) + (^oFiCiDg) — (AgBiCiDi). Consider¬ 
ing fhe bilocalify scenario in Fig. Qb) wifh 3 measure- 
menf seffings, fhe following inequalify holds: 

- (l/8)(f-/ + 16)2 + 8f < 0, (11) 

wifh I = Lx,z=0,i,2 {AxBqCz) and / = 

■ To show fhe relevance 
of fhis inequalify, notice fhaf wifhouf fhe independence 
consfraint if follows fhaf |f| -F |/| < 10. Choosing a 
correlafion given I = J = 9 v (achievable in quanfum 
mechanics for v < 1/2) we see fhaf only for v > 5/9 
fhe correlation is nonlocal. However, using (11) we 
see that this correlation is non-bilocal for v > 4/9, 
illusfrating the gap between the local and bilocal sets. 

III. NONSIGNALLING CORRELATIONS AND 
GENERALIZED BAYESIAN NETWORKS 

In the study of nonlocalify it is often useful fo de¬ 
fine fhe notion of nonsignalling (NS) correlafions 
These are fhe observable disfribufions po thaf cannof be 
used to signal between the parties, that is, the marginal 
distributions are well defined quanfifies fhaf cannof de¬ 
pend in any way on which observable fhe ofher par¬ 
ties have measured. A paradigmatic example of a NS 
correlafion is fhe Popescu-Rohrlich(PR)-box defined as 
p{a,h\x,y) = l/ 2 Sa(Sib,xy H- 

The marginal problem approach nafurally incorpo- 
rafes fhe nofion of NS correlafions IHI- For insfance, 
for fhe biparfife scenario in Fig. j^a), NS correlafions 


are fhose fhaf have well defined observable disfribu¬ 
fions p{ai,bj)\/i,i (respecting posifivify and normal- 
izafion) and well defined marginals, fhaf is, p{ai) = 
LbjPi^^iYj) = Lbj,P{‘^iYj') V;,/ (and similarly for 
p{bj)). These consfrainfs can be combined info a sysfem 
of linear inequalities LnsPo ^ 0 defining a polyfope 
thaf is characferized by finifely many exfremal poinfs. 

We can define NS correlafions in generalized 
Bayesian networks, as fhose fhaf are compatible wifh 
CLHV models where all fhe underlying (classical) hid¬ 
den variables are replaced by general NS disfribufions. 
As shown in all fhe conditional independencies on 
the level of fhe observable disfribufions po that are 
valid in the classical setup will remain valid after this 
replacement. For instance, for fhe bilocalify scenario 
in Fig. I^b), even if we allow for NS correlafions fo 
be shared befween fhe parties, if is hue fhat fhe sfafis- 
fics befween parties A and C should factorize, fhaf is, 
p(flj,Cfc) = p{ai)p{ck)- As before, we can represenf fhe 
observable independencies as a sysfem of pol5momial 
inequalities Wjvjspo > 0 . Thus, we can define gen¬ 
eralized nonsignalling (CNS) correlafions as fhose in¬ 
side fhe semi-algebraic sef E, defined by intersection of 
LnsPo ^ 0 ^rid WnsPo > 0 . Since E defines a non- 
convex body characferized by pol5momial inequalifies, 
differenfly from fhe usual case, fhere are going fo be in¬ 
finitely many exfremal CNS poinfs defining if. In spife 
of fhaf, we can still define a sensible and practical way 
fo characterize GNS correlafions. Similarly fo whaf has 
been done before, we can fake some of fhe variables 
appearing in W^spo > 0 as free parameters in order 
fo linearize if. Doing fhaf we torn E info a convex sef 
wifh finitely many extremal points that can therefore be 
characferized by sfandard linear program techniques. 

As an illusfration consider the GLHV model in 
Fig. Qb) with all parties performing fwo dichofomic 
measuremenfs. If we fix fhe marginal disfributions 
of parfs A and C fo be p(fl,) = p{cjf) = 1 / 2 , we 
see fhaf one of fhe exfremal GNS poinfs is given 
by p{a,b,c\x,y,z) = l/4(5^a5ba5c,y(:,©z), a disfribufion 
fhaf can be achieved replacing fhe hidden variables in 
Fig.j^b) by PR-boxes 1 ^ . 


IV. DISCUSSION 

Bayesian nefworks offer an almosf unexplored 
ground for generalizations of Bell's theorem. The basic 
question to be solved in this quest is how to derive poly¬ 
nomial Bell inequalities associated with more complex 
causal structures. In this work we made an important 
step in that direction. We proposed a practical and gen¬ 
eral method that can be readily applied to a wide range 
of scenarios, considering ifs applications in few GLHV 
models and deriving pol5momial Bell inequalifies char- 
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acterizing them. We have also shown how our construc¬ 
tion naturally leads to a notion of nonsignalling corre¬ 
lations in GLHV models. 

Given the fundamental role that Bell inequalities play 
in the study and practical applications of nonlocalify 
we believe fhat our resulfs will mofivafe and sef a ba¬ 
sic fool for fufure research in generalized Bell scenarios. 
The nafural nexf sfep is fo puf fhe machinery fo use in 
a variefy of scenarios and derive new Bell inequalities 
well suifed, for example, fo decrease fhe requiremenfs 
on experimenfal implemenfafions of Bell fesfs |jgj. If 
would be inferesfing fo invesfigafe fhe role of pol5mo- 
mial Bell inequalities in practical applications of non¬ 
localify, such as quanfum cr5q)fography Ij45|, random¬ 
ness generafion | |^|^ or disfribufed computing 
For insfance, fhe amounf of violafion of usual Bell in¬ 
equalities can be direcfly associafed wifh fhe probabil- 
ify of success in communicafion complexify problems 
Are fhere any communicafion problems associ¬ 


afed fo pol5momial Bell inequalities? Anofher possibil- 
ify is fo find Tsirelson's bounds ISEII associafed wifh 
fhese generalized inequalifies, fhaf is, whaf is fhe maxi¬ 
mum violafion of fhem achievable wifh quanfum corre- 
lafions. Relafed fo fhaf and inspired by resulfs such as 
information causality it would also be relevant to 
derive information-theoretical principles for fhese more 
complex Bayesian nefworks lITTl . 
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Appendix A: A method for the derivation of polynomial Bell inequalities 


As discussed in the main text, the derivation of pol5momial Bell inequalities follows from an adapfed FM elim- 
inafion over fhe combined sysfem of inequalifies Lp > 0 and Wp > 0 (wifh W = VV(p)), the first representing 
linear relations respected by a well defined probabilify disfribufion p while fhe laffer sfands for fhe (conditional) 
independence (Cl) constraints implied by a given GLHV model. Notice, however, that not all DAGs will display 
CIs on the level of p; fhis is fhe case for insfance in fhe so-called triangle scenario |^ E^E8l . 

Given the scenario of inferesf, we need fo define po and p^o sfanding, respectively, fo fhe sef of componenfs p, 
fhaf we wanf fo keep or nof in our description. We also need fo define pwNO and p(yno' corresponds fo 

fhe componenfs in p^o appearing in a non-linear fashion in fhe inequalities Wp > 0 while the latter describes the 
minimum set of componenfs fhat need fo be faken as free real paramefers in order fo linearize Wp > 0 . All the terms 
in pNo but not in pwNO can be eliminated via a usual FM elimination leading to new set of inequalifies L'p > 0 . To 
undersfand fhe FM eliminafion, notice fhaf since fhe sum of fwo valid inequalifies also defines a valid inequalify 
in order fo eliminafe a given ferm from our descripfion we basically have fo consider all possible pairwise sums of 
inequalifies where fhe coeffidenfs of fhe ferm fo be eliminafed appear wifh opposife signs. The remaining ferms 
have fo be eliminafed resorfing fo fhe adapfed FM mefhod discussed in fhe main fexf and illusfrafed in defails 
below. 

As a side remark, we notice fhaf insfead of performing fhe usual FM eliminafion leading fo L'p > 0 , one can 
equivalenfly lisf fhe exfremal poinfs in fhe subspace given by fhe supporf of L', and fhen dualize fhe descripfion 
in order fo exacfly obfain fhe inequalifies L'p > 0 . In practice which approach will be better is going fo depend 
on the scenario in question. Typically, if fhe number of ferms in po is nof large, fhe dualizafion approach will 
be reasonably fasfer. We refer fhe reader fo Ref. [ [371 for a discussion of fhe compufational advantage of fhe bofh 
mefhods in usual LHV models. In fhe following, for simplicify and wifhouf loss of generalify, we focus on fhe case 
where all measuremenfs have dichotomic oufcomes so fhaf we can equivalenfly freaf fhe problem in ferms of fhe 
correlation vecfor E wifh componenfs given by expecfafion values. 

To illusfrafe fhe absfracf discussion, consider fhe bilocalify scenario (see Fig. i(b) in fhe main fexf) in fhe parficular 
case where all the parties measure two dichotomic observables, implying the bilocality constraint Pac{iio> ^1/ Cq, Ci) = 
Pfl(flO/i?i)Pc(co/Ci). Since the variables are binary, we see that the bilocality assumption is equivalent to 16 (not 
necessarily independent) quadratic constraints. In order to linearize this set of consfrainfs we can fake VaQ,ai = 
Pa {do, di) (for each uq, a\ = 0,1) as a free real paramefer, fhaf is, we can express fhe non-linear consfrainfs W(p)p > 
0 as a linear relation W(uo,0/^o,i/^i,0/^i,i)p ^ 0- Iri ferms of expecfafion values, we have a correlation vecfor 
E = ( 1 , Lq, Eq, EqCi/ • • • / ^AqAiBoBiCqCi) (for simplicify we label (X) = Ex) wifh 64 componenfs fhaf musf respecf 
fhe linear consfrainfs TE > 0 (wifh E = T“^p). The bilocalify consfrainfs can also be expressed in ferms of 
expecfafion values. For insfance, pac{0,0,0,0) = pa{0,0)pc{0,0) is equivalenf fo 


^(1 + Eq + Eco + ^CoQ) + ^AqAi + EAq + EAj + ^AqCq + ^AqCi + ^AiCq + 
T^AjCflCi ^AqCqCi “F ^ApAjCi ”F EaqAiCq ”F ^AqAiCqC] 0 


(Al) 


wifh V = 1 — 4 pa ( 0 , 0 ), where pa ( 0 , 0 ) is fhe real free paramefer. Notice fhaf fhe bilocalify consfrainfs do nof depend 
on fhe variables Bq and Bi. Therefore all non-observable ferms fhaf depend on fhem, for insfance, 
can be eliminafed via fhe usual FM eliminafion mefhod over TE > 0 , defining a new sysfem of linear inequalifies 
T'E > 0 . The remaining ferms fo be eliminafed are fhose fhaf depend joinfly on Aq,Ai and/or Cq, Ci. We 
furfher notice fhaf all fhe bilocal consfrainfs (e.g. I Ail) only have a non-linear dependence on fhe ferms Eq^, 
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Eci EqCi- That is, all the non-observable terms but EcoQ can be eliminated via the usual FM elimination. 
After all non-observable terms have been eliminated we arrive at a final descripfion fhaf depends linearly on the 
observable terms and non-linearly on the free parameters Notice that the observable data will also imply 

linear constraints on the free paramefers themselves. Together with these constraints, each of obfained pol5momial 
inequalities can be further simplified by usual quanfifier elimination methods (see for insfance fhe function Reduce 
in Mathematica I|53j), finally arising af pol5momial inequalifies involving fhe observable dafa only. 

In fhe following we will apply fhe general mefhod fo each of fhe scenarios in Fig. i of fhe main fexf. For 
compufafional reasons we consider fhe case wifh full correlafors buf no marginal ferms, fhaf is, we keep ferms like 
{AjEjCif) buf nof ferms like or (A,). Nofwifhsfanding, as highlighfed in fhe main fexf, our mefhod can also 

be applied fo obfain a full characferization of fhe GLHV models, fhat is, including marginal ferms. 


Appendix B: Detailed derivation of the polynomial Bell inequalities 


We start considering the bilocality scenario in Fig. i(b) in the case where each party A,B and C can measure two 
dichotomic observables. We highlight that the same analysis remains valid if we consider parf B fo perform a single 
measuremenf with 4 possible outcomes, e.g., a measurement in the Bell basis. Restricting to the subspace of full 
correlafors -fhaf is, confaining ferms (AjByCjt) - we observe thaf one of the obtained inequalities is exactly eq. (8) 
of the main text. That is, this inequality corresponds to a facet of fhe bilocal set in the subspace of full correlafors. 
We further notice that in order to derive this class of inequalifies if is sufficienf fo consider - insfead of fhe full sef 
of bilocal consfraints - the simple constraint 


(AoAiCoCi) = (AoAi) (CoCi) . (Bi) 

Together with the linear constraints 

±I - (AoAi) - (CoCi) - (AoAiCoCi) < 1, (B2) 

±/ + (AqAi) + (CoCi) - (AoAiCoCi) < 1, (B3) 


we can readily prove inequality eq. (8) of fhe main texf. Inequalifies | [B2 
^ = 'Lx,z {^xBoCz) and / = (A^BiCz). Subsfifuting ||^ in 


an d (B31 directly follow from TE > 0 with 
|B^ and |B^ we obtain 


±f - (1 + (AoAi)) - (CoCi) (1 + (AoAi)) < 0, (B4) 

±7 - (1 - (AqAi)) + (CoQ) (1 - (AqAi)) < 0. (B 5) 


Notice that (1 ± (AqAi)) > 0, with equality only if (AqAi) = 3 =1, fhaf is, only if bofh oufpufs of parf A are 
determinisfic funcfions, fhaf is, only if we have fhe frivial case where either f = 0 or / = 0. For (1 ± (AgAi)) > 0, 
we can rearrange I jBq} and <I1D as 

±f/(l + (AoAi))-l - (CoCi) <0, 

±//(l-(AoAi))-l + (CoCi) <0. 

Summing both inequalities we eliminate the term (CqCi) and arrive at 

±f/(l + (AoAi))±//(l-(AoAi)) <2, (B8) 


(B6) 

(B7) 


that can be further arranged to obtain the class of inequalities discussed in the main text, given by 

2 {AoA,f + (±7 T I) (AoAi) - (2 ± f ± 7) < 0. 


(B9) 


As discussed before, we arrive af an inequalify fhaf depends linearly on the observable data (terms I and J) but 
have a non-linear dependence on the non-observable term (AqAi). Given I and J, to check that this data fulfills the 
inequality, we have to prove that there is at least one choice of (AqAi) such fhat the Ihs of I jBq^ is < 0 . Thaf is, fo 
make use of I jBg) we have to find fhe value of (AqAi) (as a funcfion of I and J) minimizing fhe pol5momial on fhe Ihs. 
Since the Ihs in i jBql defines a convex funcfion, the minimum of the inequality is achieved at {AqAi) = (±J 3= J) / 4 , 
implying the inequality in terms of observable data only. 

-(l/8)(±fT7)^-(2±f±7) <0. 


(Bio) 
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A similar derivation is possible if we allow for correlafion between parfs A and C, such fhaf | (AqAiCqCi) — 
(AqAi) (CqCi) I < Cac- Summing (AqAiCqCi) — {AqAi) (CqCi) < Cac wifh | |B2| and ||B^ we obfain 

(Bn) 
(Bi2) 


±f - (1 + (AoAi)) - (CoCi) (1 + (AoAi)) < Cac, 
±/ - (1 - (AoAi)) + (CoCi) (1 - (AoAi)) < C^c- 

Proceeding with the exact same steps as above we finally obfain 

- (l/8)(±f 2 Cac- 


(B13) 


To prove fhat a similar inequalify holds for fhe GLHV model in Fig. i(c) we can follow a very similar derivation. 
For fhis model if follows fhe independence consfrainf 


(BoBiDoDi) = (BoBi)(DoDi). 


Togefher wifh the linear constraints 


±1 — {BqBi) + (DoDi) + (BoBiDoDj) < 1, 

±/+ (BoBi) - (DoDi) + (BoBiDoDi) < 1, 


(B14) 


(B15) 

(B16) 


where I = — (AiBoCoDq) — (AiBoCqDi) + (AiBiCoDg) + (AiBiCoDi) and / = + (AqBoCiDo) — (AoBoCiDi) + 
(AqBiCiDo) — (AqBiCiDi), we can follow fhe same sfeps as above fo prove fhaf < Bio} also holds in fhis scenario. 

We now move fo fhe scenario in Fig. i(b) where parties A and C can measure fhree possible observables. To 
prove fhaf inequalify (11) of fhe main fext holds in fhis case, we need fo proceed as follows. We have fo consider 
the inequalities 


4 / - 3 (/i) + (/iCoCi) - (/1C0C2) + (/1C1C2) < 0 , 
41 - 3 (/2) - (/2C0C1) - {/2C0C2) - (/2C1C2) < 0 , 


(B17) 

(B18) 


that follow from TE > 0 wifh /j = +3 — AqA^ + AqA2 — A1A2, /2 = 3 + AqA^ + AqA2 + A1A2. If also follows 
from TE > 0 fhat 


Using the independence relation 


3/2 > 2 \I\, 

3/1 > 2 |/|, 

/i + /2 < 8. 


{AiAjC^Ci) = {A,A^) {CuCi)yi,j,k,l 


we can rewrite IB17I and (B181 as 


4 /+ (/i) (-3 + (CoQ) - (C0C2) + (C1C2)) < 0 , 
4 f + (/2) (-3 - (CoCi) - (C0C2) - (C1C2)) < 0 . 


(B19) 

(B20) 

(B21) 


(B22) 

(B23) 

(B24) 


Since ( Big} and | B2o} imply that fi and /2 are strictly positive quantifies (aparf from fhe frivial case 1 = 0 and/or 
/ = 0), we can rewrife fhese inequalities as 


4///1 + (-3 + (CoCi) - (C0C2) + (C1C2)) < 0 , 
4 , 1 /fi + (-3 - (CoQ) - (C0C2) - (C1C2)) < 0 . 

Summing bofh inequalities we eliminafe fhe ferms (CqCi) and (C1C2), obfaining 

4f//2+4 ///i-2 (CoC2)) <6. 

Combining if wifh the trivial inequality (C0C2) < 1 , we finally obfain 


(B25) 

(B26) 

(B27) 


(B28) 


(B29) 


f/i + 7/2 <2/1/2. 
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Since i Big} implies that ±1 — 2/2 is a strictly negative quantity (apart from the trivial case |f| = 2/2), we can 
rewrite the inequality above as 


-fi-Jfi/{1-2/2) < 0 . 


(B30) 


Summing it with we obtain 


/2-//2/a-2/2) <8, 


(B31) 


that can be rewritten as 


2/|+/2(-I + /- 16 )+ 8 I< 0 . (B32) 

The Ihs is a quadratic equation on the non-observable term that is convex, implying that the minimum of fhe Ihs is 
obfained af /2 = {+I — J + 16)/4 and fherefore 

- (l/8)(+f-/ + 16)2 + 81 < 0 . (B33) 









